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CLUSTER ANALYSIS AND THE INTERLANGUAGE LEXICON 



David J. Hill (DAL) 

Abstract 

As pan of a research programme investigating the interlanguage 
lexicon of Kenyan learners of English, based on the lexical field of 
locomotion, a card sorting task was used to produce raw similarity 
measure data. This was subjected to cluster analysis and 
multidimensional scaling (MDS). This paper suggests that cluster 
analysis/MDS of perceived similarities of selected verbs in a sorting 
task may indicate possible lines of investigation of the lexical knowledge 
of different groups of learners in comparison to that of native speakers. 

1. Research context 

This analysis was carried out as part of my research into the interlanguage (IL) lexicon 
of L2 learners of English in Kenya. By IL lexicon is here meant the lexical knowledge 
of the target language displayed by a second/foreign language learner. It includes both 
what the learner uses productively (which is directly observable) and also what he uses 
receptively (which is only indirectly observable). This working definition lakes no 
position regarding the psycholinguistic nature of this lexical knowledge or its relation to 
native or other language knowledge. The research objective was to see if there were any 
significant differences in the lexical performance of learners of English from different 
first language backgrounds. 

The research was based on the lexical field of locomotion, which has been investigated in 
some depth by linguists of different persuasions (see for example Leech '969, Ikegami 
1970, Miller 1972, Lyons 1977, Talmy 1975. 1985). Two basic trends can be discerned 
in the literature. Motion is analysed in one as a transition between Vocational states, 
while in the other motion itself is seen as fundamental, along w!(^ such notions as 
source, path and goal. The latter theoretical framework has been adopted here and use 
has been made of Talmy' s distinction betv een verbs that incorporate path and motion 
and those that incorporate manner and motion. English appears to belong to a group of 
langu 'ges that make particular use of manner verbs of motion (Talmy 1985). 

The results of a story retelling experiment indicated a strong overall preference on the 
part of the Kenyan learners for path-specifying motion verbs, with possible differences 
related to mother tongue (see Hill 1991). Three mother tongues were lepresented among 
the subjects who performed the task discussed in this paper: Luo (32 speakers), Nandi 
(35) and Olunyore (26). The first two are Nilotic languages and therefore distantly 
related; the last is a Bantu language. The subjects were a]l trainees at two primary 
teachers colleges in Western Kenya and had had at least ten years' education in English. 
Native speaker benchmarks were provided by 31 children at two secondary schools in the 
Edinburgh area. 
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1. The card- <ftrting technique 



Psychologists have used sorting - also known as direct grouping - as one of several 
methods to investipate the mental lexicon. Typically, subjects are given a set of cards 
with words or sentences printed on them and are asked to sort them according to 
similarity of meaning into as many classes as they wish. The method has been tried with 
different classes of words. Clark (1968) used it in a study of English prepositions. 
Miller (1969) tested his subjects with a heterogeneous collection of 48 nouns. 
Fillenbaum and Rapoport (1971) looked at words in a number of lexical fields, such as 
colour icims, verbs of possession and evaluative adjectives. Kellerman (1978) employed 
the method with a set of sentences using a single Dutch verb in a variety of literal and 
metaphorical senses. 

The theory behind each experiment has depended on the preferences of the researcher. 
Miller (1969) made the assumption that native speakers would sort nouns according to 
the semantic features they share while overiooking their distinguishing features. By 
pooling the data from a group of subjects a measure of similarity of two items could be 
obtained from the number of times the two items were put together in the same pile. 
The higher the number, the greater the degree of adjudged similarity for that group. 

However, feature theory is no longer as fashionable as it \vas when Miller wrote his 
paper. It has come under attack from various quarters - theoretical linguistics (e.g. 
Bolinger 1965, Lyons 1977, Sampson 1979), anthropology (e.g. Tyler 1978) and, more 
recently, cognitive linguistics (Langacker 1987, Lakoff 1987). It has been criticized for 
its reductionist 'atomistic' approach to meaning, which imposes an arbitrr-ry structure in 
which there is no self-evident way of showing which senses are more important than 
others. There is also no theoretical limit to the number of features that can be identified 
and we lack a metalanguage to describe some of the components. Nevertheless, it seems 
difficult to carry out any kind of contrastive lexical analysis without making use of some 
kind of semantic features - 'as problem-ridden in theory as [they are] indispensable in 
practice' (Cruse 1986: 22; he prefers the term 'semantic trait'). 

However, cognitive semantics may provide a solution in the form of prototype theory. 
To take a relevant example, each of us has a mental image of a prototypical act of 
running; though this image may differ in some details from person to person and 
between cultures, a prototypical image is essentially unfocused and non-specific (cf. 
Ukoff 1987). Now it could be claimed that in a sorting task we are in effect being 
asked to compare our mental images associated with a number of lexical items and that 
th's is something we do holistically and not on a point by point basis according to what 
Fillmore u975) calls a checklist theory of meaning. This does not preclude us from 
subsequently trying to justify our sorting on the basis of particular shared characteristics. 

All of the studies mentioned were concerned with the judgments of native speakers. The 
present researcher considered that it would be interesting to see what results could be 
obtained using the card sorting technique with groups of learners from different language 
backgrounds. 

The learners were, as already indicated, at a fairly advanced level; they were undergoing 
training to teach in schools using English as the medium of instruction. As for the 
lexical items, the majority (13 out of 20) are to be found in the General Service List. 
Four of the remaining items are in the Cambridge English Lexicon (i.e. within the 
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comprehension level of the Cambridge First Certificate). That leaves three items - 
STROLL, STAGGER and TIPTOE - which might be considered less common, though 
they arc in the Longman English Lexicon, i.e. words that learners at this level would be 
expected to have at least receptive knowledge of. The full list of items appears below in 
Table 1. 

All the items were inserted in a common sentence frame in order to provide at least a 
minimal meaningful context: 

Juma [V-ED] along the path. 

The items themselves were given prominence by being typed in upper case. Each 
sentence was presented on a card measuring 6cm by 1 1cm. Individual subjects were 
handed a set of cards and asked to arrange them into groups of similar meaning. They 
were told they could make as many groups as they wished and put any number of cards 
in each group. Apart from this they were not given any further explanation of what was 
meant by similarity of meaning. Nearly everybody set about the task quite readily. 
Subjects worked independently but were often in the same room as others doing the same 
or other tasks. They took between three and ten minutes to make their groupings. No 
timt limit or other constraint was imposed on them. Some agonized for long periods 
over the allocation of one or two cards or. in rare cases, made complete rearrangements. 
At the end a few indicated that they were not completely happy with their final choice or 
pointed out that other orderings would have been equally good. The number of piles 
made ranged from 3 to 15, with an overall mean of 7.96 and a standard deviation of 
2.54; the small differences between the group means were not statistically significant. 



3. Cluster analysis 

The similarity measures for each group of subjects can be shown in a half matrix. Table 
1 gives the raw data for the native speakers. This shows that, for example, 26 subjects 
sorted LIMP and STAGGER together, whereas only 3 put COME and WANDER in the 
same pile. It is possible to make interpretations directly from this data, but cluster 
analysis enables this to be done in a more orderly way. 

The origins of this type of analysis can be found in the development of methods of 
numerical taxonomy in botany and zoology which started in the late eighteenth century. 
Cluster analysis is itself the name for a cluster of techniques - mostly formulated by 
mathematical statisticians in the late 1960s and early 1970s as computers came to be 
more widely used - for grouping together entities of any sort (cf. Everitt 1974; Lorr 
1983; Aldcnderfer and Blashfield 1984; Kaufman and Rousseeuw 1990). The basic idea 
is that entities (which can be individuals or objects) in the same cluster are more like 
each other than they are like tho:»e in other clusters. The techniques of cluster analysis 
are therefore designed to maximize differences between clusters relative to variation 
within clusters. Similarity or dissimilarity measures (such as those in Table 1) can also 
be regarded as distances between entities. There are two general categories of clustering 
algorithms, hierarchical and partitioning. Hierarchical procedures, which involve the 
construction of tree-like structures, can be either divisive or agglomerative. We shall 
only be concerned here with the last type. 



ERLC 



Table 1: Similarity matrix for native speaker group 
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In agglomcrative methods the entities with the shortest distance between them are li, ked 
first, then those with the next shortest distance^ and so on until all the entities are joined 
in one all-embracing group. Thus in single linkage - also known as the nearest- 
neighbour method - groups initially consisting of single individuals are fused according 
to the distance between their nearest members, the groups with the smallest distance 
being fused first. Each successive fusion decreases by one the number of groups. The 
furthest-neighbour or complete linkage method is exactly the opposite in that the distance 
between groups is now defined as that between their most remote pair of individuals. 
Average linkage is a compromise between these two methods. Other methods exist, with 
var>'ing degrees of mathematical complexity, such as Ward's method and the centroid 
method. The choice of method depends very much on what the investigator is using 
cluster analysis for, but even so is by no means clear-cut. 

It should be apparent that cluster analysis is essentially exploratory and descriptive in 
nature. Since the investigator is free to choose 

• the type of similarity/dissimilarity measure 

• the clustering algorithm 

the cut-off point for determining the number of significant clusters (though 
'rules* do exist for this) 
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this gives cluster analysis a great deal of flexibility. It also renders its interpretation 
somewhat problematic. The need for some kind of validation technique is stressed by 
several authorities (e.g. Aldenderfer and Blashfield 1984). The only one which it was 
possible to use here was replication, i.e splitting each sample into two and performing 
cluster analysis on both to check for intern^ consistency. 

Because of the large amounts of computation involved, cluster analysis is only practical 
with a computer program. I used the CLUSTER program on the SPSS-X statistical 
package. This program can produce several kinds of output, but probably the most 
useful and easiest to understand is the tree graph or dendrogram. Figures 1 and 2 are 
based on the dendrograms produced for two of the groups using the method of group 
average linkage, which avoids the extremes of single and complete linkage - these tend to 
produce chain-like clusters and over-compact clusters respectively - and is recommended 
for general use by, among others, Kaufman and Rousseeuw (1990). 

For the native speakers six clusters of from two to four items stand out quite clearly; 
these merge together with the outlying items to form three large clusters. A similar 
statement could be made about the Luo speakers, except that the content of the clusters 
was in some cases quite different. The grouping of LIMP with STAGGER and 
subsequently with CREEP and TIPTOE is common to both; indeed these two pairs are 
clearly seen in all the groups, but do not coalesce in the case of the Nandi speakers. 
However, for the Luos COME and GO only join up in the larger cluster, while 
RETURN - an outlier in the native speaker group - is closely linked to COME. 



4. Multidimensional scaling 

A more graphic way of presenting cluster analysis data is by means of multidimensional 
scaling (cf. Everitt 1978; Hair et al., 1987). This is a set of techniques related to cluster 
analysis in that they operate on similarity measures, but the results are obtained by a 
totally different algorithm. 

Multidimensional scaling (MDS) programs attempt to fmd a set of points in a reduced 
number of dimensions such that the distances in this lower dimensional space are 
monotonicaily related to the similarities in the matrix. The monotonicity property (by 
which a value either never decreases or never increases) cannot in general be completely 
satisfied and a measure called 'stress' is used to assess the extent to which a 
configuration falls short of this requirement. Essentially an MDS program begins from 
an arbitrary initial configuration and proceeds in a stepwise manner making succesMve 
adjustments to the co-ordinates in order to decrease the stress. 

Figures 3, 4, 5 and 6 show MDS configurations for each group of subjects. Notice that 
Figure 6 is in fact a two- dimensional representation of a three -dimensional 
configuration, which reduced the stress value quite significantly in this case. It should 
be emphasized that this is simply a way of displaying the data obtained in this 
experiment. It may be tempting to regard lexical items as flo^^.ting round in some kind of 
semantic space in one's head, but an external representation of what someone knows is 
not necessarily equivalent to the internal form of that person's knowledge (Bransford and 
Nitsch 1978). 
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Figure I: Dendrogram for native speaker group using Average Linkage 
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Figure 2: Dendrogram for Luo speaker group using Average Linkage 
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Figure 3: Derived two-dimensional stimulus configuration for native speaker group 
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5. Discussion 



As suggested earlier, interprelaUon of cluster analysis and MDS is not a simple matter. 
In the absence of any real validation tests, one must be very cautious about conclusions. 
There is also a good deal of 'noise' in the data and there are several intervening 
variables. Moreover the cognitive demands of this task may not mirror those of 
•ordinary' language processing, so we cannot assume that we are gaining insight into 
actual lexical knowledge. Nevertheless the assignment of COME and GO to separate 
clusters in all the learner groups seems very clear. COME forms a kind of path/direction 
group with RETURN and ADVANCE, while GO belongs to a more general group of 
motion verbs - MOVE, PASS and TRAVEL. For native speakers, COME and GO are 
much more closely linked; indeed, two-thirds of the sample sorted them together. 

There are also differences between the three Kenyan language groups which rnay be 
significant. For the two NiloUc language groups ESCAPE tends to go with DASH and 
HURRY (though less strongly so in the case of the Nandi) to form a group which might 
be labelled 'rapid movement', while for the Bantu group ESCAPE is linked to CREEP 
and TIPTOE to make a kind of 'secretive movement' grou.»; this would certainly fit the 
semantic profile of the corresponding Olunyore verb. ESCAPE is something of an outlier 
for the native speakers, with weak links to HURRY and RETURN. The separation of 
LIMP and STAGGER from CREEP and TIPTOE for the Nandi speakers has already 
been mentioned, although it does not show up so clearly on Figure 5. 

A closer examination of Figure 3 might suggest that the venical dimension could be 
described as one of 'manner' (all the path verbs are in the top haiO and the horizontal 
dimension as one of 'speed', with RUN and STROLL at the two extremes (though 
COME and GO should perhaps be nearer the middle than they actually are). The 
dimension assignments of the other groups are more difficult to determine. 

It is obvious that further investigation, using larger samples and a wider variety of 
techniques, will be needed to test the generalizability of these observations, together with 
a more profound study of the poorly documented le. -ons of the Kenyan languages. 
However the results so far are ?.t least not inconsistent with the hypothesis that learners' 
IL lexical categories are influenced by those of their mother tongue. 
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